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ABSTRACT 

We report here a new directionaf cDNA library 
construction method using an in vitro site-specific 
recombination reaction, based on the integrase- 
excisionase system of bacteriophage X. Preliminary 
experiments revealed that in vitro recombinational 
cloning (RC) provided important advantages over 
conventional ligation-assisted cloning: it eliminated 
restriction digestion for directional cloning, gener- 
ated low ievels of chimeric clones, reduced size bias 
and, in our hands, gave a higher cloning efficiency 
than conventional ligation reactions. In a cDNA 
cloning experiment using an in vitro synthesized 
long poly(A) + RNA (7,8 kb), the RC gave a higher full- 
length cDNA clone content and about 10 times more 
transformants than conventional ligation-assisted 
cloning. Furthermore, characterization of rat brain 
cDNA clones yielded by the RC method showed that 
the frequency of cDNA clones >2 kb having internal 
A/ofl sites was -6%, whereas these cDNAs could not 
be cloned at all or could be isolated only in a trun- 
cated form by conventional methods. Taken 
together, these results indicate that the RC method 
makes it possible to prepare cDNA libraries better 
representing the entire population of cDNAs, without 
sacrificing the simplicity of current conventional 
ligation-assisted methods. 

INTRODUCTION 

Comprehensive cDNA analysis plays a critical role in comple- 
menting genomic information, and construction of high-quality 
cDNA libraries is essential for meaningful cDNA analysis. The 
information obtained from mammalian cDNA analysis, for 
example, is often indispensable in reliably predicting protein- 
coding sequence from genome sequence. Given the impor- 
tance of comprehensive analysis of human cDNAs, for the past 
six years we have implemented a cDNA sequencing project 
that aims to accumulate predicted protein sequences encoded 
by cONAs (1,2). The total number of nucleotide residues 
determined by our project exceeds 7 Mb to date (2). Our 
project is distinct from others in that we focus our sequencing 
efforts on large cDNA inserts (>4 kb) (3) and the results of the 



analysis of large cDNA clones arc publicly accessible at http:// 
www.kazusa.orjp/huge (4). However, through this experience 
of cDNA analysis, we realized that many technical problems in 
cDNA library construction still remain, especially for analysis 
of large cDNAs, For instance, conventional cDNA cloning 
protocols typically include the use of a restriction enzyme 
cleavage step for directional cloning (5), and thus risk over- 
looking cDNA clones that happen to have an internal restric- 
tion site that is also cleaved by the same restriction enzyme. 
Because the frequency of large cDNAs being digested with a 
particular restriction enzyme should be higher than that of 
shorter cDNAs, this risk becomes greater as the cDNA size 
increases. Further, chimeric clones, which harbor inter- 
moJccularly ligated cDNA inserts, were found at the rate of 
-3% or more in human large cDNA clones and were almost 
impossible to identify by sequencing of cDNA ends only 
(insert sizes between 5 and 7 kb; 6), whereas they are a negli- 
gible component in a pool of small cDNA clones (7,8). Also a 
concern in analysis of large cDNAs is that conventional cDNA 
libraries under-represent large cDNA clones due to severe size 
bias, in the ligation reaction, as well as during propagation of 
the cDNA plasmid population in Escherichia coll Although 
these concerns may not be serious in conventional cDNA 
cloning where multiple clones are isolated for a single gene, 
they are highly problematic for large-scale sequencing of 
cDNAs in which clones are randomly sampled on a single- 
clone-for-s ingle-gene basis. 

We considered that nearly all of these cloning problems resulted 
from using ligation-assisted cloning (LC) to construct the cDNA 
library. Very recently, a new cloning method which exploits the X- 
att recombination system of bacteriophage X in Kcoli has been 
reported (9,10). Briefly, the recombination-assisted cloning (RC) 
method transfers DNA segments into vectors using site-specific 
recombination, instead of restriction enzymes and ligase. hi one 
application, PCR products that are flanked by attBl and attB2 
sequences are efficiently transferred by recombination into 
plasmid vectors containing attPl and attP2 sequences. Recombi- 
nation between attB and attP sequences {BxP) is highly specific 
(11), and mediated by recombination proteins Int and 1HF. The 
product of the BxP reaction is a properly oriented subclone of the 
PCR product within the vector donated by the attP plasmid. The 
subcloned PCR product is now flanked by attL sequences, and in 
this form it can be efficiently transferred to other vectors, if 
desired, using the reverse reaction (LxR) of the X-att recombina- 
tion system. Additional details about the RC method can be found 
athttp://www2.1ifetechxom/content/world/gatewayman.pdf. 
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We thought that use of the same BxP recombination reaction 
could produce cDNA libraries (from double-stranded cDNA 
prepared with flanking attB sequences), and that this approach 
could solve many of the drawbacks associated with conven- 
tional cDNA library construction. In particular, RC of cDNA 
should eliminate the need for restriction enzyme treatment for 
directional cloning. It should also substantially lower the 
frequency of chimeric inserts, since terminal attB recombination 
sequences on the cDNA react only with their respective attP 
sequences, and not with each other (11). 

In the present study, we first characterized the RC method 
for cDNA cloning with respect to chimerization, size bias and 
cloning efficiency. Then we applied RC to directional cDNA 
library construction and analyzed the resultant libraries. The 
results indicated that cDNA library construction based on RC 
significantly reduces the major drawbacks of conventional 
cDNA cloning methods. In addition, this approach provides a 
new, simple route to prepare cDNA clones that arc readily 
transferred to other vectors, using RC, for further studies. 

MATERIALS AND METHODS 

Materials 

The reagents required for the RC system were obtained from 
Life Technologies as a field test kit. Comparable reagents are 
now available commercially from Gibco BRL, Oligonucleotides 
purified on polyacrylamide gels were purchased from Amer- 
sham Pharmacia Biotech, Inc., and used as adapters and an 
adapter-primer for cDNA synthesis. 

Donor (attP) plasmids carrying an ampicillin-resistance gene 
were constructed from pSP73 (Promega) and pSPORTl (Life 
Technologies), In brief, an attP cassette containing a ccdB gene 
and cbloramphenicol-resistance gene was prepared from 
pDONR201 plasmid by PCR. The amplified region corresponds to 
the nucleotide sequence from 10 to 2643 in pDONR201 (accessible 
via the World Wide Web at http;//w wwJifetech.com/content/ 
world/gatewayman.pdf), and flanked by two restriction sites 
included in the primers, Hindlll (attPl side) and Sad (attP2 
side). The attP cassette (2.7 kb) was first subcloned into the Hin- 
ffll-Sacl site of pSP73. The attP cassette subcloned into pSP73 
was excised by digestion with Xhol and BglU and then subcloned 
into the Sall-Bomlil site of pSPORT-L The resultant donor 
plasmids, which confer ampicillin resistance to Kcoli cells, were 
named attP-pSP73 (5.1 kb) and attP-pSPORT-1 (6.8 kb), and 
were prepared from DB5 cells [DHiOB: gyrA462 endA A(srl- 
m;v4)1398], provided in the field lest kit from Life Techno- 
logies. Similarly, to create a plasmid for cloning with LC, an 
attR cassette containing the ccdB and chloramphenicol-resistance 
genes was prepar ed from the plasmid pDESTl (supplied with the 
Life Technologies Field Test Kit) by PCR using 5'-CAT- 
CGCGAGGTACCAAGCTTTC-3' and 5'-CCGCCAAAACA- 
GCCGAGCTCTC-3'. Since the amplified attR cassette was also 
flanked by two artificially generated restriction sites for HindlU 
and Sad, it was subcloned into the Hindlll-Sad site of pSP73, 
as in the case of the attP cassette described above. The resultant 
plasmid was called attR-pSP73 (4.8 kb). 

Rat p-spectrin HI (PSp3) poly(A)* RNA (7.8 kb) .used in a 
model cDNA cloning experiment was prepared from a Notl- 
linearized cDNA clone (12) by in vitro RNA synthesis 
(MEGAscript™, Ambion). Rat brain poly(A) + RNA was 



obtained from Clontech or prepared from total rat brain RNA 
using a jiMACS mRNA isolation kit (Miltenyi Biotech). 

Examination of size bias and chimerization by LC and RC 

A part of the multiple cloning site in pBluescript II SK (Strata- 
gene) was amplified by PCR using the following two primers: 
S'-TGGGTACC ACAAGTTTGTACAAAAAAGCAGGCTC - 
GACGGTATCGAT-3' and 5 / -CTGGAGCTC ACCACTTTG- 
TACAAGAAAGCTGGGT GGCGGCCGCTC-3', where the 
attB 1 and attB 2 sequences of the forward and reverse primers, . 
respectively, are underlined. After digestion with Kpnl and 
Sacl, the resultant PCR product was subcloned into the Kpnl- 
Sacl site of pBluescript II SK. The resultant plasmid, attB- 
pBluescript, was used as a template for preparation of a DNA 
fragment containing multiple cloning sites (MCS). MCS (128 
bp) was obtained by PCR using attB-pBluescript and two 
primers (5 / -GGGGACAAGTTTGTACAAAAAAGCAGGC-3 / 
and 5'-GGGGACCACTTTGTACAAGAAAGCTGG-3'). DNA 
fragments containing the tetracycline-resistance gene (Tet) and 
beta-galactosidase gene (pGal) were amplified from control 
plasmids included in the field test kit provided by Life Tech- 
nologies using the same set of PCR primers. MCS, Tet and 
pGal thus obtained were digested with BsrGl (New England 
BioLabs), which cleaves at the TGTACA sequence within the 
common 15 bp core region of each att sequence, leaving four- 
base 5' overhangs. The products with or without B.yj-GI diges- 
tion were finally purified using the Concert Rapid PCR Purifi- 
cation System (Life Technologies) and then used for LC or 
RC, respectively. To prepare a vector for LC, attR-pSP73 
plasmid was digested with BsrGl and dephosphorylated with 
bacterial alkaline phosphatase. The linearized vector portion of 
attR-pSP73 was purified on an agarose gel, recovered using 
CONCERT Gel Extraction System (Life Technologies) and 
quantified by absorption at 260 nm. The quantities of MCS, 
Tet and jiGal DNA were estimated from fluorescent band 
intensities on an agarose gel with a Fluorolmager 595 (Molecular 
Dynamics). Ligation reactions were carried out in 20 |JLl 
containing lx T4 DNA ligase buffer (Life Technologies), 
45 ng of dephosphorylated, Rv/Crl-digestcd vector portion of 
attR-pSP73, 90 ng of RyrGI-digested (3Gal, 45 ng of BsrGl- 
digested Tet and 5 ng of ftyrGI-digested MCS with T4 DNA 
ligase (I U) at 25°C for 3 h. Recombination reactions were 
performed in 20 ul, containing 500 ng of attP-pSP73, 90 ng of 
fJGal, 45 ng of Tet and 5 ng of MCS, at 25°C for 3 h as 
instructed in the field test kit, and then quenched by the 
incubation of proteinase K (0.2 pg/pl) at 37°C for 10 mim The 
reaction products were purified by phenol extraction followed by 
ethanol precipitation and introduced into Kcoli cells by electro- 
poration (ElectroMAX DH10B™ cells, Life Technologies). The 
transformation efficiency of ElectroMAX DPI 1 0B cells was 
1.0-1.5 x 10'° c.f.uVjlg of pUC19. Electroporated Kcoli cells 
were plated on LB agar plates containing ampicillin (100 pg/ml). 
Plasmids were recovered from transformants (>10 3 ) on ampi- 
cill in-containing agar plates after growth in 100 ml of 2x YT 
medium for 6 h at 37°C. Plasmid preparation was carried out 
using a Concert High purity Plasmid Maxiprep System (Life 
Technologies). The obtained plasmids were digested with 
BsrGl, run on an agarose gel and stained with SYBR-Green 1 
fluorescent dye (Molecular Probes). The fluorescent gel image 
was scanned with a Fluorolmager 599 and analyzed with 
ImageQuant software (Amersham Pharmacia Biotech). For the 
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chimerization test, transformants on ampicill in-containing 
plates (~10 4 ) were cultured in 100 ml 2x YT medium in the 
presence of ampicillin (100 u,g/ml) and tetracycline (25 pg/ml) 
at 37°C for 6 h. After the culture, plasmids were prepared and 
analyzed as described above. 

Cloning of ^-spectrin III cDNA by RC and LC 

cDNA synthesis from in vitro synthesized p-spectrin III 
(PSp3) mRNA was carried out according to the supplier's 
instructions in the Superscript m Plasm id System for cDNA syn- 
thesis (Life Technologies). The first-strand synthesis was primed 
with an oligonucleotide containing dT 20 , the attB2 site (under- 
lined) and a Afofl-recognition site [attB2 dT primer, 5'-GCGA- 
AGCCC ACCACTTTGTACAAGAAAGCTGGGC GGCCGC 
(T) 20 -3']. The other adapters included an attBl site (under- 
lined) and a protruding 5'-end for annealing to the Sall- 
digested end of the cloning vector (upper strand of attBl 
adapter; S'-TCGACGCGT ACAAGTTTGTACAAAAAAGC- 
AGGCTCTTC-3'; lower strand of attBl. adapter: 5'-pGAAG- 
AGCCTGCTTTTrrGTACAAACTTGT ACGCG-3^. After 
adapter ligation, the cDNAs were separated from unligated 
attBl adapters by precipitation with polyethylene glycol 
(PEG), as follows. After precipitation with ethanol, the reac- 
tion products were dissolved in 50 p.1 of 10 mM Tris-HCl 
pH 8.0 and i mM EDTA. After addition of 30 pi PEG solution 
(20% PEG6000 and 2.5 M NaCl), the reaction products were 
incubated on ice for 15 min and then collected by centrifuga- 
tion at 1 8 000 g for 1 0 min. The adapter-ligated cDNAs were 
further size-fractionated by gel electrophoresis using low 
melting temperature agarose (13). The recovered |3Sp3 cDNAs 
(>4 kb) were subjected to the recombination reaction with BP 
Clonase (Int 4- 1HF, Life Technologies) at room temperature 
overnight. In the recombination reactions (20 pi), -20 ng 
cDNA and 570 ng attP-pSFORT-1 were included. After the 
recombination reaction, the reaction was quenched by incuba- 
tion with proteinase K (0.2 pg/pl) at 37°C for 10 min. As a 
control reaction, [5Sp3 cDNA ligated with phosphorylated 
attBl adapter was digested with Noti just before agarose gel 
electrophoresis, the recovered cDNAs >4 kb from an agarose 
gel (-20 ng) were subjected to LC with Afoti-&t/I-digested 
pSPORT-I vector (50 ng) and T4 DNA ligase ( i U) in 20 pi at 
16°C for 16 h. Both of the reaction products were extracted 
with phenol, precipitated once with ethanol and then intro- 
duced into Kcoli cells by electroporation (ElectroMAX 
DH10B™ cells, Life Technologies). 

To characterize the resultant cDNA clones, the recovered 
plasmids were digested with Noti alone or Notl plus EcoRl, 
run on an agarose gel and stained with SYBR-Green 1 
(Molecular Probes). After the gel image was captured with 
Fluorolmager 599 (Amersham Pharmacia Biotech), the 
digested plasmids were subjected to hybridization analysis by 
the method of Church and Gilbert (14) using a 32 P-labeled 
oligonucleotide probe. An oligonucleotide probe specific to 
the 3' extreme end of (3Sp3 cDNA (5'-GATTAAACAGCTCG- 
GCTTCTC-3') was SMabeled with 32 P-ATP and T4 polynucle- 
otide kinase. After hybridization at 45°C overnight, the 
membrane on which the digested plasmids were immobilized 
was extensively washed with 2x SSC at 42°Q The radioactive 
hybridization signals remaining on the membrane were quanti- 
tatively detected along the lanes with a BAS-2000 image 
analyzer (Fuji Film). 



Characterization of rat brain cDNA clones generated by RC 

After size fractionation on agarose gel to remove cDNAs 
<1 kb> rat brain cDNAs were used for cDNA library construc- 
tion by RC, essentially as described for |3Sp3 cDNA. attP- 
pSP73 was used as a donor plasmid in this experiment. Rat 
brain cDNA clones obtained by RC (~10 6 colonies) were 
growu on agar plates containing ampicillin (50 Jig/ml) and the 
plasmids were recovered from these colonies. The plasmids 
were then size- fractionated on agarose gels and those with 
insert sizes between 3.5 and 4.5 kb were retrieved from the 
gels. The recovered plasmids were then reintroduced into 
Kcoli cells by electroporation, and trans formants were 
randomly picked and inoculated in 0,8 ml of 2x YT medium. 
After growth at 37°C overnight with vigorous shaking, plas- 
mids were prepared from 192 trans formants using 96-well 
plates (as described in Tech Note Lit, No., TN004, Ivlillipore). 
The resultant plasmids were digested with Notl and run on 
agarose gels. The gel images were captured and analyzed on a 
Fluorolmager 595. 

For a direct comparison of LC- and RC-assisted cDNA library 
construction methods, rat brain cDNA libraries were prepared 
by LC and RC from the same preparation of double-stranded 
cDNAs generated using an attB2 dT primer. The cDNAs 
subjected to LC were ligated with a Sail adapter provided in the 
Superscript™ Plasmid System for cDNA synthesis (Life Tech- 
nologies) while the cDNAs for RC were ligated with an attB 1 
adapter as described above. The rat brain cDNAs used for LC 
were digested with Notl and then size-fractionated on an agarose 
gel to remove cDNAs <I kb, while those used for RC were 
directly size-fractionated in the same way. The amounts of the 
recovered cDNAs were estimated from fluorescent intensity of 
DNA bands on agarose gels. LC and RC were done using -20 ng 
of rat brain cDNAs in 20 uJ reaction mixture as described for 
pSp3 cDNA cloning. To characterize the resultant cDNA 
clones, a mixture of plasmids was recovered from more than 
9 x I0 4 colonies in each cDNA library. The recovered plasmids 
were digested with Notl, run on an agarose gel and stained with 
SYBR-Green I (Molecular Probes). Gel images were captured 
with a Fiuorolmager 599 and analyzed with ImageQuant 
software. 

Sequencing of clones generated by RC 

pSp3 cDNA clones and rat brain cDN A clones were sequenced 
with attLl sequencing primer (5'~CTGAAGCTTGGATCTC- 
GGGC-3') and attL2 sequencing primer (5 '-GCC AG AGCTG- 
CAGCTGGATG-3') using a BigDye terminator cycle 
sequencing kit (vl.0, Applied Biosystems) with an ABI373 or 
AB1377 DNA sequencer (Applied Biosystems). The quality of 
the sequencing data was enough for characterization of cDN A 
clones, but not high. This is most likely due to the fact that 
cDNAs are flanked by highly homologous 100 base attL sites, 
which is known as the TCR suppression effect' (15). 

RESULTS 

Size bias and chimerization in the RC system 

To compare size dependence of cloning efficiency and chimer- 
ization rate between LC and RC, we carried out a preliminary 
cloning experiment using three different DNA fragments 
(0. 1 3, L4 and 3 .0 kb) generated by PGR: the 0. 1 3 kb fragment 
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was prepared from a MCS of pBluescript vector; the 1.4 kb 
fragment contained a tetracycline- resistance gene (Tet); the 
3.0 kb fragment encoded beta-galactosidase (pGal). Each frag- 
ment was prepared with terminal attB sequences, as substrates 
for RC, Each attB sequence also comprised a recognition site 
for the restriction enzyme RyrGI, which leaves four-base 
cohesive termini suitable for cloning by LC. Vectors of similar 
size and derivation were used for the cloning, with the main 
difference being that the LC vector was digested first with 
BsrG\. Equi molar amounts of the three DNA fragments were 
mixed with the corresponding vector in a single tube and 
subjected to either the RC or the LC reaction. When the reaction 
products were introduced into E.coii cells by electroporation 
using ElectroMax Dill OB cells (~10 l0 c.f.u./|Xgof pUC!9), LC 
and RC yielded 3.0 x 10 s and 1.4 x 10 9 c,f.u./|ig of insert DNA, 
respectively. In several similar experiments, the absolute 
cloning efficiencies varied depending on transformation 
efficiency of competent cells used, but RC was always more 
efficient than LC. These results indicate that RC is nearly five 
times more efficient than LC under these cloning conditions. 
Since these cloning efficiencies are higher (by at least three 
orders of magnitude) than the background transformation effi- 
ciencies of LC and RC, virtually all the clones harbored inserts. 
Both cloning reactions contained the same molar amounts of 
DNA fragment, so the size distribution of cloned inserts should 
reflect the respective cloning size bias. The recovered plasmids 
from >10 5 colonies were linearized by digestion with BgUl and 
then run on an agarose gel (Fig. 1 A). From fluorescent signal 
intensities of DNA bands, we estimated the relative molar 
contents of plasmids carrying MCS, Tet and pGal to be 
100: 1 1 :0.5 for RC and 100:4:<0.1 for LC. Thus, although both 
methods produced size bias, these results demonstrate that RC 
caused considerably less size bias than LC. If we take the 
difference in cloning efficiency between LC and RC into 
account, the yield of clones carrying large DNA inserts 
produced by RC is clearly better than that by LC. 

In the next experiment, we examined occurrence of chimeric 
(i.e., inlermolecularly ligated) clones, Ampicillin-resistant 
clones (~ 1 0 4 colonies) obtained by LC or RC were cultured in 
liquid medium containing ampicillin and tetracycline, and the 
plasmids were then recovered from the E.coii cells. If no inter- 
molecularly ligated molecules are cloned, the resultant plas- 
mids should harbor only Tet since neither MCS nor pGal 
conferred tetracycline resistance on E.coii cells. The gel 
images of the recovered plasmids after digestion with BsrGV 
reveal that this was the case for RC, whereas the plasmids 
recovered from clones yielded by LC obviously contain inter- 
molecularly ligated products, where MCS is fused with Tet 
through the BsrG\ site (Fig. IB). These results demonstrate 
that RC produces virtually no chimeric clones, as expected. 

cDNA library construction using an in vitro synthesized 
RNA by the RC system 

For directional cDNA cloning by RC, slight modifications to 
conventional cDNA library construction methods are required. 
Since cDNA library construction based on the method of 
Gubler and Hoffman (5) is widely used, we modified this 
method to replace the ligation step with RC for insertion of 
cDNAs into a vector. Figure 2 shows the overall scheme of the 
RC-assisted cDNA library construction method. In this 
scheme, we used an attB 1 adapter with 5 '-end phosphorylated 
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Figure 1. Comparison of cloned plasmids produced from an equimolar mixture 
of three DNA fragments of different sizes by RC and LC. Plasmids retrieved 
from transform ants generated by RC and LC were digested with BglU (A) or 
BsrGl {B), and then run on 0.7 or 1.2% agarose gels, respectively. Plasmids 
were linearized by BrM digestion, and inserts were excised from the vector 
(pSP73) by BsrGl digestion. Because the resultant clones by RC contained 
recombination site sequences in the vector portion* the size of the RC pktsmid 
became slightly larger than tbatof tbe LC plasmid even when it carried (he same 
insert. In (B), contaminating E.coii chromosomal DNA appeared as the upper- 
most band in both lanes and the excised fJGal fragment, which should run 
slightly slower than tbe vectors, was not seen in either lane. 

lower strand, as in the case of conventional ligation-assisted 
cloning. Although ligation of phosphorylated attB 1 adapter 
was anticipated to yield a considerable number of products 
flanked by two attB 1 adapters, as shown in the box in Figure 2, 
only the attB 1 adapter at the 5'~end should serve as a produc- 
tive recombination site for cDNA cloning. This was confirmed 
by the actual cloning experiments described below. 

In the first experiment, we applied the overall scheme 
depicted in Figure 2 for cloning of long mRNA to compare the 
cloning efficiency and the size bias of RC with those of LC. In 
this case, long poly(A)-tailed RNA (7.8 kb) was synthesized 
from rat pSp3 cDNA (12) and its integrity was confirmed on a 
formaldehyde-denaturing agarose gel (data not shown). 
Double-stranded cDNAs were prepared using the in vitro 
synthesized RNA as a template and then separate cloning 
reactions were performed using two different ^Sp3 cDNA 
preparations to compare the resultant cDNA libraries; double- 
stranded cDNAs ligated with phosphorylated attBl adapter 
with and without Nofl digestion. Before cloning, both of the 
reaction products were size-fractionated on an agarose gel to 
enrich for cDNAs >4 kb. The cDNA preparation without Nofl 
digestion was subjected to RC, while the cDNA with Notl 
digestion was used for conventional LC cDNA library 
construction, by ligation into M?fKS«/l-digested pSPORT-L 
When the reaction products of RC and LC were introduced into 
E.coii cells by electroporation, the cloning efficiencies by RC 
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First-strand cDNA synthesis with aHB2-dT primer 
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Figure 2. A scheme of cDNA library construction based on RC. Steps involved 
in RC-based cDNA library construction arc schematically shown and were 
almost identical to those described in the instructions of Superscript Plasmid 
System (Life Technologies) except for the use of the attB2-dT adapter primer, attB 1 
adapter and inclusion of an in vittv recombination reaction. Oligonucleotide 
sequences used for the attB2-dT primer adapter, upper and lower strands of 
attB 1 adapter are as follows; attB2-dT primer, 5'-GCGAAGCCC ACCACTTT- 
GTACA AGAAAG€TGGGC GGCCGC(T) 2(r 3 / ; upper and lower strands of attBl 
adapter, 5'-TCG ACG CGTAC A AGTTTGTAC A A A A A AG C AGGCTCTTC- 3 f 
and 5'-G A AG AGCCTG CTTTTTTGTAC AA ACTTGT ACGCG -3', where the 
attBl and attB2 sequences are underlined. An exacted reaction product using 
phosphorylatcd attB 1 adapter is illustrated. In the phosphorylaled attB 1 adapter, 
the 5'-end of only the lower strand of the atlBl adapter was chemically phos- 
phorylated. 



and LC were 5,0 x 10 s cXu./p,g and 3.6 x 10 7 c.f.uVu.g of the 
cDNA insert, respectively. 

To analyze the size distribution of the mixture of pSp3 
cDNA inserts thus obtained, we prepared cDNA plasmids from 
>2 x 10 4 colonies of these two libraries. These plasmid 
mixtures were digested with restriction enzymes that do not cut 
internally in pSp3: Not\ to linearize the plasmids, or Noll plus 
EcoRl to release the inserts. A Notl site is located between the 
3'-end of cDNA and attL2 site, and two EcolU sites are 
present, one 145 bp upstream of the 5 '-end of cDNA (in the 
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Figure 3. Comparison of size bias effects imposed on cloning of a 7.8 kb |JSp3 
cDNA by LC and RC. RC and LC shown above lanes in (A) and (15) indicate 
recovered cDNA plasmids prodticed by RC and by LC, respectively. DNAs 
were separated on a 0.7% agarose gel, stained with SYBR-Green I, transferred 
onto a nylon membrane and hybridized with a J3 P-1abeled oligonucleotide 
probe specific to the 3'-eud sequence of psp3. (A) and (B) show the DNA 
staining patterns and the hybridization pattern of (3Sp3 cDNA plasmids 
digested with Noil and Noll plus EcoRl, respectively. In (B) the hybridization 
pattern of the gel-recovered pSp3 cDNA before cloning is also shown. Size 
marker DNAs (1 kb Plus DNA Ladder, Life Technologies) were run in lane M. 
(C) Distribution of hybridization signals along each lane of the gel-recovcrcd 
psp3 cDNA and cDNA plasmids in (B). Siars in (A) and (B) indicate the 
position of linearized pSPORT-1. 



multiple cloning site of pSPROT-1) and the other 330 bp 
downstream of the 3'-end of cDNA. Figure 3 displays the 
staining patterns and the pSp3-specific hybridization patterns 
of the cDNA samples digested with Notl (Fig. 3 A) and Notl 
plus EcoKl (Fig. 3B). Although all these plasmids were deriv- 
atives of pSPORT- 1 vector, the vector portion, of RC clones is 
-450 bp larger than that of LC clones due to the presence of 
attL sites and their flanking sequences. In addition, because 
attLl sites are present in the excised inserts from RC clones, 
actual cDNA insert sizes of RC clones are -100 bp smaller 
than those estimated from migration rates of Notl-EcoRl frag- 
ments on an agarose gel. For RC clones, a small fragment 
carrying the attL2 site (-330 bp) was also excised from the 
vector, which is seen as a faint band in the DNA staining 
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pattern of the RC sample in Figure 3B. Taking these differ- 
ences in plasmid structure into account, we carefully examined 
the staining and the hybridization patterns of RC and LC 
clones. It is notable that the clones yielded by LC and RC are 
clearly different: cloning of pSp3 cDNA (>4 kb) by LC gener- 
ated considerable amounts of spurious clones, because plas- 
mids smaller than pSPORT-1 vector (which would migrate at 
the position indicated by the star in Fig, 3A and B) and without 
hybridization signals are apparent in Figure 3 A (indicated by a 
brace). In contrast, the lanes of RC products show little or no 
evidence of such spurious clones. In Figure 3A and B, (3Sp3 
cDNA plasmids yielded by LC and RC were loaded on the gel 
so as to give approximately the same signal intensities of full- 
length cDNA (7.8 kb). Thus, the stronger bands of the LC 
sample in Figure 3A and B indicate that the full-length content 
of LC clones was lower than that of RC clones. The size distri- 
bution of cloned [5Sp3 cDNA was quantitatively examined by 
monitoring the radioactive hybridization signals along the 
lanes in (B) (Fig. 3C). From the quantitation of the hybridiza- 
tion signals, the contents of the full-length (3Sp3 cDNA were 
estimated to be 7% in the RC system and 4% in the LC system, 
while the insert DNA before cloning contained f 3% of the full- 
length pSp3 cDNA. These results confirm that RC caused less 
size bias than LC in actual cDNA cloning. Nine clones 
randomly sampled from the RC library were sequenced from 
both the attL sites, and all were found to carry inserts in the 
expected orientation (data not shown). 

Construction and characterization of rat brain cDNA 
library prepared by the RC system 

The results described, above show that RC offers several 
advantages over conventional LC, particularly for large 
cDNAs. In the next step, we applied RC to the construction of 
an actual cDNA library, prepared from rat brain poly(A)+ 
RNA, and analyzed the resultant cDNA clones. Double- 
stranded rat brain cDNAs after attB 1 adapter ligation were 
size-fractionated on an agarose gel and the cDNAs >1 kb were 
subjected to RC. The average cloning efficiency of rat brain 
cDNAs (>I kb) was 6.0 x 10 8 c.f.u./^g of cDNA insert in three 
independent experiments (ranging from 2.3 x 10 s c.f. u./ug to 
8.6 x !0 8 c.f.u./u.g). The transformation with attP pSPORT-1 
plasmid alone gave 10 4 c.Lu./flg of vector. When we character- 
ized ten randomly sampled cDNA clones by DNA sequencing 
from both attL sites, all the clones carried cDNA inserts 
flanked by the recombination sites in the expected orientation 
(i.e., attL! and attL2 sites were located at the 5'- and 3'-ends, 
respectively). Although our preliminary experiments had 
suggested that phosphorylated attB 1 adapters were ligated with 
most cDNAs at both ends before the recombination reaction 
(data not shown), these results thus indicate that the BxP 
reaction can use DNAs bearing two attB 1 sites and one attB 2 
site as substrates to faithfully generate expected recombination 
products. 

To compare the RC method to the conventional LC method 
in actual cDNA cloning, we prepared two cDNA libraries, one 
by theRC- and the other by the LC-assisted method, using the 
same lot of rat brain cDNA in each case. For LC, the cDNA 
was digested first with Notl then size fractionated on an 
agarose gel to remove cDN As smaller than 1 kb, prior to liga- 
tion with vector; for RC, the cDNA was directly size fraction- 
ated, prior to use in the BxP recombination reaction. Once 



again, the cloning efficiency obtained with the RC method was 
-6 x I0« c.f.u./jig of cDNA insert, whereas the cloning effi- 
ciency by LC with the same cDNAs after Notl digestion was 
1 ,5 x 10 8 c.f.u./pg. In this LC experiment, we used the Sail 
adapter provided in the kit in place of the attB I adapter as 
described in Materials and Methods because the Sail adapter 
gave a higher LC efficiency than the attB 1 adapter. 

We next compared the size distribution of cDNA inserts 
cloned by RC and LC- For this, cDNA plasmids were digested 
with Notl, run on an agarose gel and the fluorescence intensity 
of the linearized plasmid was monitored along each lane. In 
this comparison, we adjusted the calculated insert sizes to 
account for the 500 bp larger size of the RC vector. Overall, a 
similar distribution of insert sizes was obtained with RC and 
LC. Because most of rat brain cDNAs were between 1 and 
2 kb, the difference in size bias between RC and LC was diffi- 
cult to see clearly using these small cDNAs. In recent experi- 
ments (O.Ohara and T.Watanabe, unpublished), we have found 
that a lower size bias of RC is more apparent when cloning 
cDNA populations >3 kb, consistent with the results of our 
model experiment cloning 7.8 kb rat £Sp3 cDNA by LC and 
RC, described above. 

Conventional LC for directional cDNA library construction, 
in contrast to RC, includes restriction digestion of cDNAs with 
a rare cutter, like Notl, prior to cloning. We sought to demon- 
strate that cDNAs containing internal Notl sites could be 
cloned with RC (Table 1). For this analysis, rat brain super- 
coiled cDNA was fractionated on an agarose gel to enrich for 
clones harboring inserts of 3-4 kb, because frequencies of 
clones having internal Notl sites should be higher in large 
cDNAs than in small cDNAs. Then the size- fractionated DNA 
was introduced into E.coli, and plasmids were prepared from 
192 randomly picked colonies. According to the cDNA 
construction method used in this study, all the clones should 
have one artificially generated Notl site at the 3'-end of the 
cDNA insert. Notl digestion is thus expected to yield single 
bands of linear cDNA plasmids, unless additional Notl sites are 
present internally. Therefore the appearance of multiple bands 
after Notl digestion on agarose gel electrophoresis indicates 
the presence of internal Notl sites in a cDNA insert. 

Nine of the 192 cDNA clones examined (4.7%) contain an 
internal Notl site, and eight of the 100 clones (8%) with inserts 
>2 kb have internal Notl sites (Table 1). DNA sequencing 
revealed that all nine clones were independent. Two of these 
clones contain chimeric inserts, which are readily discerned by 
the presence of an additional attB2-oligo-dT-adapter-primer 
sequence (harboring the second Notl site) immediately 
following the expected attLl sequence. The arrangement of the 
recombination sites in these chimeras suggests that they arose 
by 5'-end-5'-end ligation of cDNA-attB2-oligo-dT molecules 
during ligation of the attB 1 adapters, following second strand 
synthesis (data not shown). Three of the 192 clones have no 
Notl site. Two of these are unreacted attP-pSP73 donor plas- 
mids, and the remaining clone contains a mutation within the 
Notl site of the atlB2 dT adapter-primer. Taken together, these 
data (i) confirm RC cloning of cDNAs containing internal Notl 
sites and (ii) verify a low incidence of chimeric inserts (1%) 
and empty vector (1%), Finally, we classify two of the 192 
clones as plasmid dimers, because Notl digestion halved their 
sizes, yielding single bands on agarose gels, and the sizes of 
the inserts excised from the vector were consistent with the 
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sizes estimated in a Notl-digesteti form rather than those in a 
covalently closed circular form (data not shown). Overall, the 
incidence of internal Noli sites is six out of the 97 clones with 
inserts >2 kb. Additional results demonstrate that the molar 
content of cDNA >4 kb in a rat brain double-strand cDNA pool 
was reduced by Noll digestion from 16 to 10% (O.Ohara and 
T.Watanabe, unpublished). 

Table 1. Frequency of occurrence of interna] Notl sites in rat brain cDNA 
clones obtai ned by RC 



cDNA Insert size (bp) :i Number of Notl sites per cDNA 



I 2 0 Total 



500-1000 


15(2) b 


1 


0 


16 


1000-2000 


76 


0 


1 


77 


2000-3000 


19 


2 


2 


23 


3000-4000 


57 


4(2y 


0 


61 


4000-5000 


13 


1 


0 


14 


5000-6000 


0 


I 


0 


1 


Total 


180 


9 


" 3 d 


192 



"Because plasm ids harboring 3-4 kb cDNA inserts in a snpercoiled form 

co-migrated with those harboring 1-2 kb cDNAs in art open circular form on 

agarose gel electrophoresis, die size distribution of analyzed cDNA inserts 

had two peaks, at 1-2 and 3^1 kb, as shown. 

"The number of plasmids found as dtmers is shown in parentheses. 

c Thc number of chimeric clones is shown in parentheses. 

d One plasmid carried a mutation at the posiiion corresponding to a Noll site 

whiic two other plasmids were accidentally recovered attP donor plasmids. 



DISCUSSION 

Prior descriptions of RC have emphasized its utility for rapid 
and efficient cloning of PCR products, together with their 
parallel subcloning into multiple types of expression vectors 
(9,10). In the present study, wc evaluated several potential 
advantages for directional cDNA cloning that RC should 
provide over conventional methods using LC. These include 
directional cloning without use of restriction enzymes and 
lower levels of chimeric inserts. These advantages derive from 
the different cutting and joining mechanisms employed by 
these two approaches. 

The relatively long (-25 bp) recombination sequences used 
for RC should occur by chance very rarely, if at all, in any 
population of cDNA. Consequently, internal cleavage of 
cDNA insert sequences by the RC reaction should be negli- 
gible. In contrast, restriction enzyme cleavage sites, even for 
rare cutting enzymes, exist much more commonly in a large 
population of cDNA molecules. Internal cleavage. of cDNAs 
becomes a more serious concern in the cloning and analysis of 
large cDNAs, since the occurrence of internal rare restriction 
sites in cDNA increases with increasing cDNA size. Consistent 
with this, our results (Table 1) demonstrate Notl cleavage sites 
in -6% (6 out of 97) randomly sampled cDNA clones with 
inserts >2 kb, as expected if one Notl site occurs roughly every 
4 8 bp. Although other LC-based cDNA library construction 
methods may avoid this problem (for example see 16), the 
simplicity, robustness and high cloning efficiency of the 
Gubler-Hoffman protocol continue to make this a popular 



method (5). Our results indicate that RC-based cDNA library 
construction resolves this problem without sacrificing any of 
the advantages of conventional cDNA cloning methods. 

Another important difference between RC and LC for prepa- 
ration of cDNA libraries is the level and the structure of 
chimeric cDNA inserts. The attB recombination sites on cDNA 
molecules used in RC can recombine only with respective attP 
sites on cloning vectors, but not with attB sites on other cDNA 
molecules as demonstrated in a model experiment described in 
this study. In contrast, the cDNA inserts employed in LC can 
react not only with the ends of vectors, but with other cDNA 
molecules as well. Thus LC inevitably promotes some level of 
random concatenation of both insert cDNAs and vector, 
leading to significant levels of chimeric inserts. Although RC 
does not include any step expected to generate chimeric 
inserts, we observed chimeric clones in rat brain cDNA library 
constructed by RC-assisted method at a rate of ~ \ %. The struc- 
ture of these chimeric clones suggests that they were generated 
as by-products of attB 1 adapter ligation, not by the BxP reac- 
tion. By increasing the amount of attB I adapter and/or dephos- 
phorylating the cDNA-attB2 molecules prior to ligation to the 
attB I adapter, it should prove possible to reduce the frequency 
of chimeric clones even further. An important point is that 
chimeric clones arising in cDNA libraries constructed by 
conventional LC approaches are difficult to detect by end 
sequencing alone. In contrast, our ability to easily identify 
these chimeric clones in the RC-assisted library only by end- 
sequencing of cDNAs is a significant benefit in using this 
library for comprehensive cDNA analysis. 

RC also addresses the difficulties inherent with the subse- 
quent transFer of large cDNAs from one vector to another, such 
as for functional analysis. Large cDNAs frequently amplify 
inefficiently by PCR, and finding appropriate restriction sites 
with which to excise large cDNA inserts from a vector can be 
problematic. But when cDNAs are introduced into vectors 
using RC, their subsequent transfer to other vectors by recom- 
bination is fast and efficient, making it possible to subclone 
large cDNAs as easily as small cDNAs into a variety of vectors 
(9,10). Therefore, RC is a method highly suitable for both the 
cloning and analysis of large cDNAs, which have remained 
elusive targets in conventional cDNA analysis. 

In previous studies, open reading frames were introduced 
into the RC system either by PCR with primers bearing attB 
(25 bp) recombination sites or by LC into a vector with attL 
recombination sites (9,10). The present method therefore 
provides an efficient new route to introduce cDNAs into the 
RC system, which avoids the need for restriction enzyme 
digestion and minimizes the incidence of chimeric clones. To 
date, we have successfully constructed several different cDNA 
libraries by this method, and used these for comprehensive 
cDNA analysis. 
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